Multipartite Nonlocal Quantum Correlations Resistant to Imperfections 
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We use techniques for lower bounds on communication to derive necessary conditions in terms 
of detector efficiency or amount of super-luminal communication for being able to reproduce with 
classical local hidden- variable theories the quantum correlations occurring in EPR-type experiments 
in the presence of noise. We apply our method to an example involving n parties sharing a GHZ-type 
state on which they carry out measurements and show that for local-hidden variable theories, the 
amount of super-luminal classical communication c and the detector efficiency r\ are constrained by 
r j2~ c / n = 0(n~ 1//6 ) even for constant general error probability e = O(l). 
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I. INTRODUCTION 

Forty years ago, Bell introduced the notion of 
"quantum nonlocality" : he showed that the correlations 
between the outcomes of measurements carried out on 
entangled quantum systems cannot be reproduced by a 
local classical theory (often called a local hidden variable 
theory) . Since then, extensive work has been carried out 
on quantum nonlocality, both on the experimental and 
theoretical aspects. On the theory side research on quan- 
tum nonlocality has branched out into many different and 
complementary directions. 

One important direction of investigation is the search 
for qualitatively different types of quantum nonlocality. 
Of particular interest is the discovery of the Greenberger- 
Horne-Zeilinger (GHZ) paradox In this and related 

examples, correlations are characterized as nonlocal by 
the pattern of zero and nonzero joint probabilities. This 
property has been called "pseudo telepathy," because in 
every run of the experiment, the parties appear to agree 
clandestinely on a subset of admissible outputs. It should 
be contrasted with other examples where it is the values 
of these joint probabilities which implies nonlocality. 
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Another important advance was to show that quantum 
nonlocality subsists even in the presence of noise as first 
demonstrated by Clauser, Home, Shimony, and Holt [j]. 
This is essential since every experimental test will neces- 
sarily be affected by imperfections; the best experiments 
to date have error rates of the order of a few percent. 
Much additional work has been devoted to understanding 
the resistance of quantum nonlocality to imperfections. 

In experiments involving entangled photons, there is 
one particular kind of imperfection that plays a central 
role, namely the small efficiency of single-photon detec- 
tors. A single-photon detector will register the presence 
of a photon with probability 77, and will not register the 
presence of the photon with probability 1 — 77. For in- 
stance, as one goes from visible to infrared wavelengths, 
77 decreases from more than 50% to 10%. Detector ineffi- 
ciency can be thought of as a specific type of noise. This 
imperfection was first discussed by Pearle |5j and remains 
to this day one of the major hurdles to overcome in order 
to carry out a loophole-free test of quantum nonlocal- 
ity. Examples show that there are quantum correlations 
that are highly insensitive to detector inefficiency, but 
are much more sensitive to other kinds of noise, see Mas- 
sar 6] , and therefore this kind of imperfection should be 
studied independently of other kinds of noise. 

Note that the complementary error, namely detectors 
clicking when they should not, can also occur. We con- 
sider this error as a general noise, as it cannot be distin- 
guished from other types of noise such as non-maximally 
entangled states. 

The development of quantum information theory over 
the past ten years (see |jf for a review) has brought 
a breath of fresh air to the study of quantum nonlo- 
cality, and important new questions have been raised. 
For instance Bell showed that the quantum correla- 
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tions could not be reproduced classically without "super- 
luminal" communication between the parties. But Bras- 
sard et al. @ and Steiner Q initiated the study of how 
much super-luminal communication is required to repro- 
duce the correlations. This question is closely related 
to quantum communication complexity, in which one en- 
quires whether certain distributed communication tasks 
can be solved using less quantum communication than 
is required classically; see |lOj for a survey of quantum 
communication complexity. 

Remarkably, the amount of classical communication 
required to reproduce the quantum correlations and the 
minimum detector efficiency required to close the detec- 
tion loophole are closely related quantities as demon- 
strated by Gisin and Gisin ^l| , and Massar Q . In many 
cases, quantum correlations that require a lot of com- 
munication to reproduce classically cannot be simulated 
classically without communication, even when the actual 
detectors are very inefficient. 

Another question that has been raised in the context 
of quantum information theory concerns the asymptotic 
limit when the size of the entangled system grows. Does 
the gap between classical and quantum correlations grow, 
and if so, at what rate? Brassard et al. [8j showed that 
in the bipartite case the amount of communication re- 
quired to classically reproduce the quantum correlations 
can increase exponentially with the number of entangled 
bits shared by the parties. And it follows from the results 
by Buhrman et al. 12] that there are quantum correla- 
tions for n parties each holding a two-dimensional sub- 
system, so that the amount of communication that must 
be broadcast in a classical simulation increases logarith- 
mically with the number of parties. Unfortunately these 
asymptotic results have only been proved in the total 
absence of noise. 

The only prior asymptotic results in quantum com- 
munication complexity that hold in the presence of 
noise concern multi-round quantum communication pro- 
tocols, such as the appointment-scheduling problem of 
Buhrman et al. ^| or the example due to Raz 0]. It 
appears that these results cannot be mapped to results 
concerning quantum nonlocality, whereas communication 
complexity problems with a single round of communica- 
tion and nonlocal quantum correlations can generally be 
mapped one onto the other. 

The present work lies at the intersection of these dif- 
ferent lines of enquiry. Specifically we concentrate on 
the generalization of the GHZ paradox to n parties pre- 
viously considered by Buhrman et al. |T^ : their bounds 
on the GHZ-inspired multiparty communication problem 
was only proved in the absence of noise. We extend it to 
the noisy case. 

These GHZ-type correlations involve n parties. We 
suppose that there is a fixed (i.e., independent of n) 
nonzero probability e for an error to occur. Denote by c 
the number of bits communicated (via a possibly super- 
luminal channel) in order to reproduce the correlations. 



We show that 

c = ri(rtlogn) . 

Denote by r/* the maximum detector efficiency for which 
a local classical model exists. We show that 

r,* = 0(n- 1/6 ) . 

In fact, the superluminal communication and detection 
efficiency can be traded one for the other: we combine 
the above two results into the following bound: 

v * 2 -o/n = o (n~ 1/a ) . (1) 

This bound sheds new light on the relation between these 
two quantities, which was previously discussed in 0, Il5| . 
Our result constitutes to our knowledge the first example 
in which the degree to which the quantum correlations 
are nonlocal increases with the size of the entangled sys- 
tem in the presence of noise and as such constitute a sig- 
nificant advance in our understanding of quantum com- 
munication complexity and of quantum nonlocality. 

The present work builds upon the earlier results of 12] 
and . As in these references we rely heavily on tech- 
niques and ideas from the field of communication com- 
plexity. The reader unfamiliar with these notions may 
consult the book by Kushilevitz and Nisan 0] for an 
introduction to classical communication complexity. 

The remainder of this article is organized as follows. In 
Section[n]we define precisely the main concepts of nonlo- 
cality used in this paper. In Section IlIII we introduce the 
combinatorial notion of monochromatic rectangles and 
prove a general relation between 77, c, and e, which de- 
pends on the maximum size of almost monochromatic 
rectangles. This general result is of interest in its own 
right and could be of use when studying other instances 
of quantum nonlocality that exhibit pseudo telepathy. In 
Section Hvl we apply the general bound to the GHZ para- 
dox; the proof of Eq. (JIJ is based on an addition theorem 
for cyclic groups proved in Section Finally, we discuss 
our results and open problems in Section IVII 

II. NONLOCALITY DEFINITIONS 

Consider the following situation. There are n spa- 
tially separated parties; party i receives an input Xi £ 
{1, . . . , k} and produces an output o» G {1, . . . , I}. With 
x = (x\, . . . , x n ) and a — (a\, . . . , a n ), let P(a\x) denote 
the probability of output a given input x. The inputs 
are distributed according to the probability distribution 
fi(x). We formalize this situation as follows. 

Definition 1 An (n,k,£) correlation problem with in- 
put distribution fi is a family of probability distributions 
P(-|x) on the "outputs" {1, ...,£} n , for each "input" 
x £ {1, ... , k} n with /z(x) > 0. We denote the support of 
/J, by D :— {x : /i(x) > 0}. 
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Note that in nonlocality experiments the distribution fi 
should be a product distribution, otherwise the parties 
would have trouble selecting x according to [i when the 
measurements take place in timelike separated regions. 
On the other hand the mathematical proofs given below 
and in particular the example of Section llVl are based on 
non-product distributions. The way to get around this 
is the following: during the nonlocality experiment the 
inputs are distributed according to a product distribution 
fio, for instance the uniform distribution. Then when 
analyzing the data one first throws away part of the data 
in such a way that, for the data that is kept, the inputs 
are distributed according to the desired distribution jjL. 
This can only make the task harder for the local hidden 
variable theory, since it does not know before hand which 
runs will be kept and which will be thrown away. From 
now on we let /x be an arbitrary (possibly non-product) 
distribution. 

We are interested in correlation problems obtained 
from measurements on multipartite entangled quantum 
states. We define these as follows. 

Definition 2 An (n, k, £) measurement scenario is a cor- 
relation problem in which the parties share an entangled 
state \ip); each input Xi determines a positive operator 
valued measure (POVM) X{ = {x\, . . . ,xf} with x\ > 0, 
&j = !»• If the measurement of party i produces 
outcome x\, then it outputs <2j = j. The probability 
Pqm(o-\x) to obtain outcome a given input x is 

Pqm(ci\x) = (i^lxl 1 <g>-..® x£"|V>) • 

Our aim is to study what classical resources are re- 
quired to reproduce such measurement scenarios. Let us 
first consider classical models in which the parties cannot 
communicate after they have received the inputs. Such 
models are called local. The best the parties can do in 
this case is to randomly select in advance a deterministic 
strategy. This motivates the following definition. 

Definition 3 A deterministic local hidden variable (lhv) 
model is a family of functions A = (Ai, . . . , A n ) from the 
inputs to the outputs: Xi : {1, . . . , k} —* {1, . . . ,£}. Each 
party outputs dj = Xi(xi). 

A probabilistic lhv model (or just lhv modeL) is a prob- 
ability distribution v{X) over all deterministic lhv models 
for given (n,k,£). 

Thus in probabilistic lhv models the parties first ran- 
domly choose a deterministic lhv model A using the 
probability distribution v. Each party then outputs 
a t = Xi(xi). 

We also consider classical models with communication. 
In such models, the parties may communicate over a pos- 
sibly superluminal classical broadcast channel in order 
to reproduce the quantum correlations Pqm- Different 
communication models exist depending on whether the 
parties do not have access to randomness, possess local 
randomness only, or share randomness. These notions 



are adapted from the corresponding definitions in com- 
munication complexity. 

Definition 4 Consider n parties who each receive an in- 
put Xi <= {1, . . . , k} 7 communicate over a classical broad- 
cast channel, and each produce an output dj € {1, . . ., 
£}. 

A deterministic classical model with communication is 
a rooted "communication protocol" tree V; each internal 
node u is labeled with the party i u S {l,...,n} whose 
turn it is to broadcast a message; each edge e from u 
to a descendant is labeled with a set X e C {1, . . . , k} so 
that the X e form a partition of {1, . . . , k}; each leaf v is 
labeled with a lhv model X v . An execution of the protocol 
on input x starts at the root of tree; until a leaf is reached, 
the execution proceeds from node u to the descendant of 
u that is reached via the edge e with x\ n G X e . It is 
understood that the choice of the edge is broadcast to all 
parties so that all parties know at each moment at which 
node the execution is. When the execution has reached 
the leaf v, each party i outputs X Vi i(xi) and the execution 
terminates. If there are m leaves and if the number of 
children of the nodes on the path from the root to the 
final leaf is t\,...,t m , the number of bits broadcast is 
c = [logti] H h [logt m ] . 

A classical model with shared randomness is an ar- 
bitrary probability distribution v(P} over deterministic 
classical models. An execution of such a model first prob- 
abilistically selects a deterministic model and then eval- 
uates the deterministic model. 

In a classical model with local randomness, the distri- 
bution v(V) is constrained to be a product distribution of 
the individual strategies of the parties. 

Of course, a classical model that always uses bits of 
communication is just a lhv model. 

Definition 5 For a correlation problem P with input 
distribution /i, we denote by D(P), R(P), and R puh (P), 
respectively, the minimum number of bits that must be 
broadcast in order to perfectly reproduce the correlations 
P when the parties are deterministic, have local random- 
ness only, or have shared randomness. 

Where the choice of the correlation problem P is clear 
from the context, we drop it and write D, R, and R puh . 

Clearly, D(P) > R(P) > R puh (P). Since the results of 
quantum measurements are inherently random, it is in 
in general impossible to reproduce the quantum correla- 
tions using deterministic lhv models or using determinis- 
tic models with communication. Thus D{P) is meaning- 
less when trying to simulate quantum measurement sce- 
narios. However, deterministic models are a very useful 
tool for studying the probabilistic models because prop- 
erties of all deterministic models necessarily also hold for 
all probabilistic models, since the probabilistic models 
are just probabilistic mixtures of deterministic models. 
Note also that Massar et al. showed that R(P) can 
be infinite when P arises from a quantum measurement 
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scenario. In general, classical models cannot reproduce 
the quantum correlations Pqm unless communication is 
possible, the detector efficiency 77 is sufficiently small, or 
the error probability is sufficiently large. 

Let us consider now the situation where the detec- 
tors are inefficient. In this case we enlarge the space 
of outputs to a,i € {1, . . . , £} U {J-}, where a, =_L is the 
event that the ith detector does not produce an output 
("click"). We suppose that each measurement Xi has 
probability r\ of giving a result and a probability 1 — r\ of 
not giving a result. Whether a detector clicks or does not 
click is independent of the other detectors. This affects 
the probabilities in a more structured way than simply 
decreasing the probability that all detectors click simul- 
taneously. This issue has been discussed by Massar and 
Pironio |18j ; for simplicity we will consider here only the 
two extreme cases, namely that all detectors click (which 
occurs with probability rf 1 ) or that at least one detector 
does not click. We define detector efficiency accordingly. 

Definition 6 Let P(-\x) be a fixed (n,k,£) correlation 
problem with input distribution fi. Let 

C := {a : Vi Oj 

denote the output vectors where all detectors click. With 
slight abuse of notation, we also use C as the indicator 
random variable of the event a € C . We define the detec- 
tion efficiency r\ of the correlations to be the expectation 



:= E„ 



Y, P (a\x)C 



"IN V™ 



Note that here the atomic events are tuples (x, a) of an 
input and an output vector with a joint distribution of 
the form Pr[input x and output a] — ^i{x)P{a\x). The 
expectation above is over the marginal distribution fi of 
the inputs. 

We are also interested in the possibility that the 
lhvmodel makes errors. 

Definition 7 Suppose that some classical model pro- 
duces a probability distribution P(a\x), which should ap- 
proximate the probability distribution produced by a mea- 
surement scenario Pqm{o\x). The total-variation dis- 
tance is a measure for how much these two distributions 
differ: 



E„ 



Y,\P Q M(a\x)-P(a\x)\^ 



The inclusion of the factor C /rf 1 takes care of the possible 
finite efficiency of the detectors, assumed to be the same 
for Pqm{o\x) and for P(a\x). 

We will be particularly interested in quantum corre- 
lations that exhibit "pseudo telepathy", i.e., such that 
Pqm{cl\x) = for some a and x. For such correlations it 
is convenient to define the error probability as follows. 



Definition 8 Let 

F:={(a,x):P QM (a\x)=0} 

and again we also denote by F the indicator random vari- 
able of the event Pqm(cl\x) = 0. The error probability 
is 



E„ 



Thus e is the probability to observe in one run an event 
that cannot occur in the quantum mechanical model. It 
is immediate to check that 

For an (n,k,£) correlation problem P(-\x) with input 
distribution /1, we denote by rf the maximum detector 
efficiency of any lhv model that reproduces the quantum 
correlations, and by rj* the maximum detector efficiency 
that reproduces the quantum correlations up to error 
e. Similarly, we can define D £ , R £ , i?P ub the amounts 
of communication required to reproduce the correlation 
problem P in the presence of error. We are interested in 
77* and by i? pub . 

We can map every communication model with c bits 
of communication with shared randomness into a model 
with inefficient detectors with efficiency rf 1 = 2~ c : the 
shared randomness determines the conversation between 
the parties. Thus they all agree on the conversation. 
Each party i checks whether its input Xi is compatible 
with the conversation and, if yes, produces output <2j ac- 
cording to the communication model and otherwise pro- 
duces no output, i.e., _L. The total probability that all 
detectors click is equal to the probability that x belongs 
to the conversation. Since each input belongs to one and 
only one conversation, the probability that all detectors 
click is equal to one over the number of conversations. 
Note that in this model the probability that a specific 
detector, say detector i, clicks may depend on the in- 
put Xi. However, the probability that all detectors click 
remains independent of the input. 

Theorem 1 Consider lhv models where the probability 
that all detectors click is independent of the input, but 
where the probability that each detector clicks, say detec- 
tor i, may depend on its input Xi. Then there exists a 
lhv model if the probability rf 1 that all detectors click is 
at most 2~ RP . This implies that in these models, 



> 2 



-R p 



(2) 



This result was given in 15] in the absence of error, but 
it also holds when errors are present. 



III. COMBINATORIAL BOUNDS 

We now introduce some definitions and notation, which 
allow us to state and then prove our result concerning a 
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general relation between c, rj and e. We are concerned 
with pseudo-telepathy type correlations for which there 
are some P(a\x) that vanish. 

Definition 9 Let P(-\x) be a fixed (n,k,£) correlation 
problem with input distribution fi. We define the sets of 
inputs that admit output a as 

adm(a) := {x : P(a\x) > 0} 

for all a G C. Moreover, for a set S C {l,...,/c} n 
of inputs and a specific output a G {1, . . . ,£} n , the a- 
advantage of S is 



adva(S') 



(i(Sn adm(a)) 



for all a G C. 

For sets A\, . . . , A n , a subset R of the Cartesian prod- 
uct A\ x • • • x A n is called a rectangle if there are R\ C Ai , 
■ • • ! -Rn C A„ such that R — Ri x ■ ■ ■ x R n , i.e., i? is a 
Cartesian product itself. The importance of rectangles 
is that for a deterministic lhv model A = (Ai, . . . , A„), 
the set R\(a) := {x : X(x) = a} of all inputs x leading to 
output a is a rectangle: R\(a) = A^f (ai) x • • • x \^ l 1 (a n ). 

Theorem 2 Let P be a fixed (n, k, t) correlation problem 
with input distribution fj,. If for some S (0 < 5 < 1), all 
rectangles R with adv a (i?) > S have fi(R) < r for every 
a G C , then for every classical model v(V) with c bits of 
communication holds 



< £ n r. 



This shows the strong relation between the detection 
efficiency and the amount of classical communication re- 
quired to reproduce the correlations. Indeed one quantity 
can be traded for the other. 

Proof of Theorem |5J Let R-p >Vt a denote the set of in- 
puts x for which the deterministic protocol V terminates 
in leaf v and outputs a. Every Rj> tVta is a rectangle. Let 
L := {(V,v,a) : &dv a (R v , v>a ) > d}. Then 



T) n (l-e)=Y,v{'P)ti*)C{l-Jn 



V,x 

^2 v{V)ii(R v , v ,a n adm(a)) 
^2 v(V)n(Rv,v,a)&&v a (R v 

,u,a } 

(V,v,a)eL (V,v,a)£L 
< 2 c d n r + S J2 v (PM R r,v,a) 

where the v range over the leafs of V and the a over 
{!,..., £} n . Similarly 



< 



t\'-e 



v{V)li(x)CF 



v(V)n(Rp,v, a n ({1, . . . , k} n \ adm(a)) 
Y v(V)n(R-p lV>a ){l - adv a (R v , Via )) 



>0+ v(V)KRv,v,a){l - 6) 

= (1-5) V (PMRp,*,a) 



Hence, 



rf (1 - e) < Ttr + - V n e , 

l — o 



which implies Theorem [3] 



IV. APPLICATION TO THE GHZ 
CORRELATIONS 

In this measurement scenario each of the n parties has 
a two-dimensional quantum system. The overall state of 
the n qubits is 



|0") + |1") 
V2 



(3) 



where \i n ) = \i) (8 • • • <8> \i) with n terms in the product. 
Each party receives as input Xi G {0, . . . , k — 1}. Each 
party then measures his qubit in the basis 



|Q)±e 7ria «/ fe |l) 

71 



(4) 



If the qubit is projected onto state \<p+), then party i 
outputs ai = and if the qubit is projected onto state 
\ip— ), party i outputs a* = 1. 
We call an input x = (x\, . . . , x n ) valid if it satisfies 




Xi mod k = 



(5) 



and we let DcZJ denote the set of all valid inputs. Let 
F : Z£ — > {0, 1} denote the Boolean function on the valid 
inputs defined by 




Xi mod 2k 



F(,) . I 



The function F can be viewed as computing the (1 + 
logfc)-th least significant bit of the sum of the Xi. 

It is easy to check that the outputs of the quantum 
measurement are correlated as follows: if Eq. (JjjJ holds, 
then 



a, j mod 2 — — 



k 




Xi mod 2k 



F(x) 



Hence, if each party broadcasts its measurement outcome 
then each party can locally compute F(x). 
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Lemma 3 In the model with prior entanglement and 
classical broadcast communication, the communication 
complexity of computing F(x) is 0(n). 

Moreover, the above measurement scenario will exactly 
reproduce the following (n, k, 2) correlation problem (see 
Definition QJ : 

Definition 10 Let fi(x) be a distribution on the inputs 
that gives zero weight to the invalid inputs x, which do 
not satisfy Eq. and let 



P(a\x) 




if F(x) = a 1 
otherwise. 



mod 2 



for all a G {0, 1}" and x G D. 

A simple classical strategy for reproducing these corre- 
lations is for every party to broadcast its input. Hence, 
with k — n 1 / 6 , the communication problem and the cor- 
relation problem can be solved exactly with O(nlogn) 
bits of communication. 

Note that for n = 3 and k = 2 the above correlations 
constitute the GHZ paradox as formulated by Mermin Q . 
The case k — 2 and arbitrary n was studied by Mer- 
min 0| and recently revisited by Brassard et al. [2(1 [^J. 
In Buhrman et al. [12j and our earlier research [15J the 
case where the number of settings k is a power of two 
was considered. In 12] it was shown that the amount c 
of classical communication which the parties must broad- 
cast in order to reproduce exactly the correlations from 
Definition 1101 is c = O(nlogn) when k = O(n). And in 
|15| it was shown that the maximum detector efficiency 
?/* for which a local classical model can reproduce ex- 
actly the correlations from DcfinitionllOldecreases as 1 /n. 
Furthermore the classical strategy described above shows 
that in the absence of noise these results are essentially 
optimal. 

We will now show that this optimality continues to 
hold in the presence of noise and that classical strategy 
described above remains close to optimal in the presence 
of noise. Specifically we will show that 

Theorem 4 Let fi be the uniform distribution on valid 
inputs. Then the number c of bits broadcast, the effi- 
ciency r\ and the error e of every lhv model v are con- 
strained by 

x/n 

2 + ' 



1 

2 c / r • 



7? l-£ 



1 



= o 



1 

TT/e 



which, for fixed e, n large, implies Eq. In particular 
we have 

Corollary 1 Every bounded-error randomized public 
coin protocol for F : Z£ — > {0, 1} with k > n 1 / 6 requires 
Q(nlogn) bits of communication. 

We now turn to the proof of Theorem 0] We say a 
rectangle R = A\ X • • • X A n C k n involves m parties if 
at least m of the n subsets Ai have size at least 2. Every 
rectangle involving at most m parties can have size at 
most k m . 



Lemma 5 (Small rectangles are insignificant) 

Every rectangle R involving at most n 5 / 6 parties satisfies 
\og\R\ < ri 5 / 6 logfc. 

We say a rectangle R has bias at most 6 if 

l-F-^i) n d n R\ < (i + 5)|-F _1 (0) n d n R\ 

and 

|F _1 (o) n d n R\ < (i + n d n R\. 

Note that for every a we have adm(a) n D — F~ 1 {a\ + 
• ■ - + a n mod 2)nD. Therefore, if \x is a distribution that 
is uniform on D, then R has bias at most 8 if and only 
if it has a-advantage at most (1 + <5)/(2 + 5) for every a. 
The next lemma expresses that every "large" rectangle 
is almost unbiased. 

Lemma 6 (Large rectangles are almost unbiased) 

Every rectangle involving at least n 5 / 6 parties has bias 
at most 0(l/n 1 / 6 ). 

The proof of Lemma^lis based on addition theorems for 
cyclic groups and is given in the next section. 

Proof of Theorem |4j Lemma implies that each rect- 
angle involving at least n 5 / 6 parties can have a-advantage 
at most l/2+0(l/n 1 / 6 ) for any a. Hence, rectangles with 
a-advantage greater than 1/2 + 0(l/n 1 / 6 ) must involve 
less than n 5 / 6 parties. By Lemma such a rectangle R 
has size less than k 



,5/6 



and thus 



fi(R) = \R\/k n 



Plugging these values into Theorem |3 we obtain 



1-e 



2 + 



,1/6 



< 2 _ 5™ lo S"+°( n ) 



V. AN ADDITION THEOREM 

Let TLt denote the additive cyclic group of order T. 
Let Ha{x) denote the multiplicity of an element x in the 
multiset A. For multisets A and B of Zt, let A + B 
denote the multiset {a + 6|a£ A,b £ B}. 

Definition 11 We say a multiset A of has bias at 
most e with respect to a subgroup H ^ if [ia (a) < 
(1 + e)/_t J 4(a + h) for all a € A and all h G H . 

Theorem 7 (Addition Theorem) Let Ai, .. . , A r be 

subsets of^T, each of size at least 2, with r > T 3 and 
T = 2 t a power of 2. Then the multiset Ai+ A2 + ■ ■ - + A r 
has bias at most 0(T 3 ^ 2 /r 1 ^ 2 ) with respect to the sub- 
group {0, 2*~ 1 }. 
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Essentially, this theorem is derived by a sequence of sim- 
ple reductions to the following observation: We may gen- 
erate an almost uniformly distributed random number 
between and K — 1 by flipping a fair coin K 2 times, 
and counting the number of heads modulo K . 

Lemma 8 For multisets A and B over Zy, if A has bias 
at most e with respect to some subgroup H , then so does 
A + B . In particular, the multiset A + {d} has the same 
bias as A. 

Lemma 9 Let f : {0, 1} S — > Z/f be defined by 

f(ai, . . . , a s ) = ^2 mod K ■ 

Ifs > K 2 , then \r\x)\ < (l + 4^) \f~\y)\ for all 
x,y £ Z K . 

Proof. First suppose x < y. Then 



lr l (*)l = E 



x + iK 



x + iK 



= E 

i:y+iK<s/2 

< y ( 3 

^ \y + iK 

i:y+iK<s/2 xa 

+ E 

i:y+iK>s/2 

< y ( 3 

^ \y + n< 

i:y+iK<s/2 x 

K y + iK 



E 

i:y+iK>s/2 



S 

x + iK 



x + iK + K 



s/2 



E 

i:y+iK>s/2 



s/2 



\.r\y)\ + 



s/2 



Similarly if x > y, then still {/^(x)] < \f-\y)\ + ( s s /2 ). 
Thus, for all y G Zk, we have that is within 

(s/2) °^ ^ ne avera g e value of 2^. Hence, 



4 2 s K * , , . 



K 

7i 



from which follows 

s 



< 



5 VI _ 4 



Lemma 10 Let B\ = ■ ■ ■ = B s = {0, b} be s identical 
size-2 subsets of Z^, with s > T 2 . Then the multiset 
B\+ B 2 + ■ ■ - + B S has bias at most A\H\/s 1 ^ 2 with respect 
to the subgroup H = (b) . 



Proof. Set K = \H\ and define function / : {0, 1} S -> 
by /(01, ...,a s )= (X)i=i a m0( ^ Then we may 
generate the multiset B\ + B 2 + ■ ■ ■ + B s as b ■ /({0, 1} S )- 
Applying Lemma gives that / is almost unbiased on 
Zx and hence b ■ f is almost unbiased with respect to H . 

□ 

Lemma 11 Le£ B±, . . . ,B r be size-2 subsets of%T } with 
r > T 3 . There exists a nontrivial subgroup H ^ It such 

that Bx + B-2^ h B r has bias at most 4T 3 / 2 /r 1 / 2 with 

respect to H . 

Proof. First suppose G £?, for all z. There exists 
some nontrivial element b £ Zt such that Bi = {0, b} 
for s of the subsets, with s > r/T > T 2 . Applying 
Lemma ^| on these s subsets yields a multiset of bias 
at most 4|(6)|/s 1/2 < 4T 3 / 2 /r 1 / 2 with respect to (b). By 
Lemma |H1 adding the remaining r — s subsets to this 
multiset does not increase the bias. 

In general, we do not have that £ Bi for all i. In 
this case, observe that by Lemma |H1 adding any offset 
to a multiset does not change its bias, and thus we may 
reduce to the former case by adding an appropriate offset 
di to subset Bi such that £ Bi + {di}, for each i. □ 

Proof of Theorem [3 Let Bi A; be a random size-2 
subset of Ai, for each i. By Lemma ITT1 the sub-rectangle 
R' = B\ X • • • X B r is almost unbiased with respect to 
some nontrivial subgroup H'. Since H' is nontrivial, it 
contains H = {0, 2*~ 1 }, and hence R' is also almost unbi- 
ased with respect to H. By this selection process, every 
(fli, . . . , a r ) £ A\ x • • • x A r has the same probability 
of being selected and, hence, R itself is almost unbiased 
with respect to H . □ 

Proof of Lemma |5J Set t = |logn and T — 2* . Con- 
sider any rectangle R — A\ x • • • x A n involving at least 
r > n 5 / 6 = T 5 parties. By the Addition Theorem, the 

multiset A\-\ h A n has bias at most 0(T 3 / 2 /r 1 / 2 ) C 

□ (l/n 1 / 6 ) with respect to {0, 2' -1 }. Hence, rectangle R 
has bias at most (Xl/n 1 / 6 ), too. □ 



VI. CONCLUSIONS 

We studied experiments for validating quantum nonlo- 
cality in the presence of noise and with imperfect detec- 
tors. Specifically we concentrated on the generalization 
of the GHZ paradox to n parties previously considered 
as a quantum communication complexity problem [l^ . 

There are several directions in which one may wish to 
improve the result Eq. The first concerns the eval- 
uation of the right-hand side of this relation. A detailed 
investigation of the proof shows that the right-hand side 
becomes nontrivial only for values of n that exceed a few 
hundred. Therefore our result will not be useful for the 
moderate values of n, say, n < 10, which may be attain- 
able by real-world experiments in the next few years. It 
would be interesting to try to improve Eq. so as to 
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make it relevant for small values of n. Can the gap be- 
tween the result in the absence of noise (when the right- 
hand side is 0(n -1 )) and the result in the presence of 
noise be closed? 

Another question concerns our notion of error, which 
is not entirely appropriate to a multiparty setting: one 
expects that each party may induce an error indepen- 
dently of the other parties. Thus it would be more nat- 
ural to consider that the probability of an error goes as 
e = 1 — 8 n . We do not know whether a constraint of the 
form Eq. holds in this case. 

Notwithstanding the above directions in which im- 
provements are possible, there is a specific sense in which 
the above result can be shown to be close to optimal. 



Consider n parties who share an entangled state \if>) of 
dimension 2™. Each party's system is two dimensional, 
i.e., each party has a single qubit. Fix a total- variation 
distance e V ar- Then for any measurement scenario in- 
volving local measurements on the quantum state 
the amount of (super-luminal) communication required 
to reproduce these correlations up to total-variation dis- 
tance £ va r is at most of order nlogn, and the maximal 
detector efficiency r/* for which these correlations are lo- 
cal is of order n~ c for some constant c. This result will 
be reported elsewhere j^. It shows that the example 
considered above is close to maximally nonlocal, at least 
if one restricts oneself to a large number of parties each 
possessing a single qubit. 
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